Dark energy constraints from lensing-detected galaxy clusters 
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We study the ability of weak lensing surveys to detect galaxy clusters and constrain cosmological 
parameters, in particular the equation of state of dark energy. There are two major sources of noise 
for weak lensing cluster measurements: the "shape noise" from the intrinsic ellipticities of galaxies; 
and the large scale projection noise. We produce a filter for the shear field which optimizes the 
signal-to-noise of shape-noise-dominated shear measurements. Our Fisher-matrix analysis of this 
projected-mass observable makes use of the shape of this mass-function, and takes into account the 
Poisson variance, sample variance, shape noise, and projected-mass noise, and also the fact that the 
conversion of the shear signal into mass is cosmology-dependent. The Fisher analysis is applied to 
both a nominal 15,000 deg^ ground-based survey and a 1000 deg^ space-based survey. Assuming a 
detection threshold of S/N = 5, we find both experiments detect « 20, 000 clusters, and yield 1-a 
constraints of Awo ~ 0.07, Awa ~ 0.20 when combined with CMB data (for flat universe). The 
projection noise exceeds the shape noise only for clusters at 2 < 0.1 and has little effect on the 
derived dark-energy constraints. Sample variance does not significantly affect either survey. Finally, 
we note that all these results are extremely sensitive to the noise levels and detection thresholds 
that we impose. They can be significantly improved if we combine ground and space surveys as 
independent experiments and add their corresponding Fisher matrices. 
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PACS numbers: 98.80.-k; 95.36.+x; 98.65.Cw; 98.62. Sb 



I. INTRODUCTION 

Constraining the dark energy equation of state and 
density parameters is the objective of many cosmologists; 
not few of them have considered using present or future 
cluster data to attain this goal. Cluster methods rely 
mostly on detection and counting of objects using some 
mass-indicating observable; for constraining cosmology 
there are 4 ways of finding clusters: optical emission 
by galaxies, the X-ray emission by the hot intraclus- 
ter medium, the Sunyaev-Zeldovich effect on CMB, and 
gravitational lensing. 

X-ray clusters received a lot of attention from cosmol- 
nsts 111 la, Saii, then SZ clusters 011 Eg. See 
Il2l [l3 | for thorough comparisons between the effi- 
ciency of extracting cosmological information from X-ray 
and SZ clusters and also for using complementary stud- 
ies of the CMB and supernovae distance measurements to 
improve constraints. For optically-detected cluster sur- 
veys and their results see the work of 0, 0, EE E3, Ell ■ 
The main issue with the X-ray, optical, and SZ clusters 
is the so-called mass-observable relation: cluster masses 
are not measured directly, but they are estimated from 
the real observables, such as the X-ray temperature or 
flux or the SZ integrated flux. But, as pointed out in 
|l2j| , the mass-observable relation can have non-standard 
redshift evolution and, if not carefully calibrated, the 
constraints on dark energy parameters are compromised. 
Cross-calibration of the mass-observable relations be- 
tween different types of surveys could, though perhaps 
not very efficiently, ameliorate the situation; more re- 
cently, self-calibration using the sample variance of the 
counts due to clustering of clusters has been proposed 
as a non-costly alternative to cross-calibration, eg see 
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Weak gravitational lensing detects clusters via the 
slight distortions imparted on the images of background 
galaxies. The lensing information is obtained from shear 
maps: clusters are detected using some filtering technique 
that finds the points where the signal-to-noise (S/N) is 
high enough for us to conclude that there is an over- 
density. The filtered shear is the mass observable. The 
relation of this observable to the projected mass is very 
simple and unambiguous. The strength of the lensing 
method is this lack of ambiguity in the mass-observable 
relation. 

A difficulty arises because such a point with high S/N 
does not necessarily correspond to a virialized cluster: it 
could also be the result of unvirialized large scale struc- 
ture projected along the line of sight, or the superpo- 
sition of multiple unrelated lower-mass objects |2ll l2a|. 
Even for virialized clusters, there will be substantial scat- 
ter between the projected-mass observable and the tra- 
ditionally defined virial mass ^3 . But there is in fact no 
underlying need for lensing observables to correspond to 
other cluster observables, nor even to a dynamical defini- 
tion of mass. We will argue that counts of projected-mass 
overdensities are just as useful for cosmology as counts 
of virialized halos. 

In this paper we study how well a 4— dimensional pa- 
rameter space can be constrained using weak-lensing- 
detected clusters. The parameters in question are 
(the matter density parameter), wq, if a and erg (the am- 
plitude of the matter power spectrum), wq and Wa de- 
fine the time-varying equation of state of dark energy: 
w{a) = Wo + {1 — a)wa- We use as examples two pro- 
posed future surveys, a ground-based Large Survey Tele- 
scope (LST) [2^ and the space-based Supernova Accel- 
eration Probe (SNAP) |2^. We first review the role of 



2 



cluster-mass observables in constraining cosmology. In 
ijllll we describe a filter based on S/N maximization of 
shear measurements that determines the minimum de- 
tectable mass of an object placed at a certain redshift. 
In ijlVl we calculate the cosmological constraints obtained 
when assuming that the intrinsic ellipticities of back- 
ground galaxies are the only source of noise. In ^\ we 
treat the large scale structure projection errors and in 
i)VII we draw some conclusions. 

The fiducial ACDM cosmological model for this whole 
paper is: flat universe, flm — 0.27, Cg — 0.9, Wq — — 1, 
Wa ^ 0, h = 0.72, Ofc/i^ = 0.024, consistent with the 
first- year W^AL4P results 0. 



II. MASS OBSERVABLES AND COSMOLOGY 

Any observable statistic can be used as a cosmological 
test, as long as (1) it can be measured on the real Uni- 
verse, and (2) its value can be predicted as a function 
of the cosmological parameters of interest. The utility 
of the statistic depends upon: the accuracy with which 
the measurement can be made; the accuracy with which 
the predictions can be made; and the sensitivity of the 
statistic to the parameters. To forecast the parameter 
accuracies, as we aim to do here, we need only estimate 
these three characteristics of the statistic. 

Overdensity-counting methods have as their statistic 
the distribution dN/dft dz dM of "clusters" in a solid an- 
gle vs redshift as a function of some observable quantity 
M [the clustering of these overdensities may be an addi- 
tional statistic]. In an ideal world, the observable would 
be the virial mass M, because there exist analytic frame- 
works [27112^ for predicting their distribution, a.k.a. the 
mass function. To constrain dark energy at interest- 
ing levels, the mass function must be predicted to an 
accuracy that will undoubtedly require A'^-body simula- 
tions, not just the analytic frameworks. For more recent 
work on mass functions and their accuracy see for in- 
stance [23, • This is also of course true for the real- life 
substitutes for virial mass: the x-ray flux/temperature, 
the SZ decrement, galaxy counts, and the lensing shear. 
There is no long-term advantage to an observable that 
is closely correlated to the virial mass. The ultimate 
utility of these methods will depend upon the fidelity of 
the numerical predictions, and here the lensing method 
is clearly superior. The shear prediction needs only the 
mass distribution, and 83% of the mass is easily-modelled 
collisionless dark matter. X-rays and SZ decrements de- 
pend fully upon the more complex baryon distribution, 
and the electron temperature as well. Cooling and den- 
sity fluctuations particularly affect the x-ray predictions. 
Galaxy counts are even more difficult to predict. It would 
be bold to assert that modelling will ever predict any of 
these observables other than the shear to the percent- 
level accuracies we will someday desire. 

Are analytic mass functions adequate for forecasting 
parameter accuracies? To be so, they must roughly — 



but not exactly — predict the number of peaks in M, so 
that our Poisson errors are properly estimated. When 
M is a projected mass measurement, several numerical 
studies j21j, show that up to tens of percent of de- 
tections can be "false positives" in the sense of having 
no corresponding virialized cluster, and some virialized 
clusters are missed. The Poisson statistics are grossly 
perturbed only when the S/N threshold is low enough 
that measurement-noise peaks overwhelm the mass sig- 
nals. 

The analytic model must also properly capture the 
dependence upon cosmological parameters. The "false 
positives" when M is the projected mass are not virial- 
ized, but they are real structures whose abundance will 
scale with erg and the linear growth rate in a manner not 
grossly different from the virialized structures ■ 

The projections which distinguish lensing-derived pro- 
jected masses from dynamical masses can be divided into 
two classes: first, there are projections between mass 
structures that are widely separated along the line of 
sight, in which case there is no angular correlation be- 
tween the projected halos. In f^we treat such projec- 
tions as a source of random noise on the mass deter- 
minations of detected clusters induced by projection of 
below-threshold halos. 

The second difference between lensing mass and virial 
mass is that the former includes all the structure along 
the line of sight that is correlated with the mass peak, 
not just the virialized (or unvirialized) core. The lensing 
"mass function" is therefore distinct from the mass func- 
tion of virialized halos, but it is just as well defined. We 
will simply assume for this paper that the projected-mass 
function is equal to the virial-mass function produced by 
the Press-Schechter formalism and related techniques, in 
the absence of numerical or analytic evidence that the 
two differ dramatically. 

We conclude that the distinction between the weak- 
lensing projected-mass observable M and the virial mass 
A/ will not be a barrier to its use as a precision cosmo- 
logical constraint, and that we can use the virial mass 
functions for approximate forecasts of these constraints. 
Naturally, for accurate constraints on dark energy, one 
will have to use a lensing mass function derived from 
numerical simulations. 



III. THE MINIMUM DETECTABLE MASS 

Given a cluster at some redshift, we would like to de- 
termine the smallest value of its mass such that it could 
be detected through weak lensing (WL) effects. In this 
section we assume that the intrinsic ellipticities of the 
galaxies represent the dominant source of noise for the 
projected mass measurement. We shall test this assump- 
tion in ilvl where we consider the large scale projection 
effects. 
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A. S/N maximization 

As mentioned in the introduction, for WL measure- 
ments the observable is the shear, which encodes informa- 
tion about the direction and magnitude of the distortion 
of background galaxies images. The components of the 
shear contain derivatives of the deflection angle with re- 
spect to the apparent position of the source galaxy. In the 
case of WL, distortions and magnifications are so small 
that we can very well approximate the relation between 
the components of the induced shear (71,72) and those 
of the measured image ellipticity (ei, 62) in the following 
way (see for example): 
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i e {1,2} 



(1) 



(61,62) represents the intrinsic ellipticity of the galaxy. 
From measurements of N galaxies, the shear can be es- 
timated with the accuracy: 



Var(7i) = ^ 



(2) 



where cr^ is the uncertainty in the measurement of one 
galaxy. For a detailed investigation of the shape noise, 
we refer the reader to |33j. An approximate expression 
is HI: 
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(3) 



A filtered shear map is used to detect clusters by select- 
ing the points where the S/N peaks above some thresh- 
old, (S'/A^)min- But the filtered shear values are also used 
in the interpretation of the detected clusters. We are at 
liberty to choose any filtering method we wish, provided 
we can tie its results to a physical quantity for which 
there exists a theory. We choose to normalize our filter to 
reproduce the virial mass when applied to canonical clus- 
ters, namely spherically symmetric objects, with NFW 
density profiles in the context of a ACDM cosmology — 
i.e. what we expect the average (if not typical) cluster 
profiles to be. This close scaling will allow us to use 
the well-studied and tested mass function theory to pre- 
dict the number of detectable clusters. Therefore, the 
two requirements for our filter are first that it yield a 
value closely related to the virial mass for canonical clus- 
ters; and second, that the virial mass estimator produce 
a maximum S/N from the shear map. 

For a circularly symmetric lens, the shear has only one 
component, tangent to the annulus about the cluster cen- 
ter. We define our mass estimator as a weighted sum of 
the shears in such annuli: 



M ^2_^ w{6k, Zi)^Ti9k, Zi) 

k.i 



(4) 



where k designates the annulus with angular radius 6k, 
^T{6k, Zi) is the measured shear of annulus k on sources 



in redshift bin i, and w{9k, Zi) is the weight on this shear. 
The variance of this estimator is: 

Var(M)= w{9k,z,)w{ek',Zj)Cov[-fT{9k,z,)jT{0k',Zj)]. 

k,k' 

(5) 

When the noise of the intrinsic ellipticities of galaxies is 
dominant, the contributions of different redshift bins to 
the estimator variance are uncorrelated and the above 
expression reduces to: 

Var(M) = ^*)' Var(7T(0fc, zO) . (6) 

k^i 

The desired maximization of S/N is obtained for the fol- 
lowing set of w's: 

w(Ok:Z,)(x—— — ^,Vk,i. (7) 

The normalization constant in equation Q is set by the 
first property of our filter: if the real clusters have NFW 
profiles and the real cosmology is ACDM, then the esti- 
mator must return the virial mass of the clusters. Thus, 
the constant is: 



C = 



Efe,(7^^^"'(^fe,^.))VVar(7T(0k,zO) 



(8) 



The shear and the optimal weight separate into a part de- 
pending only on the lens properties, and a part depending 
only on the lens and source redshifts. For redshift bin i 
we write: 

J^'^'' dZs'P{Zs)Z{Zs, Zd)^oo{zd) 

= ^ — j:rdz.viz^ — ■ 

In the above equation, 700 (2:^) is the shear of a hypothet- 
ical source at infinity, 'P{zs) is the redshift distribution 
of source galaxies (see ijlll B|l and Z is given by: 



if Z, > Zd 



Z{zs,Zd) Q otherwise 



(10) 



where Dds is the angular-diameter distance between the 
lens and the source and Dg is the angular-diameter dis- 
tance between theobserver and the source. Throughout 
this paper we consider only the case of a flat universe. 
In the limit of an inflnite number of redshift bins and if 
we introduce the shear per unit mass for the canonical 
cluster, 7oo, equation © becomes: 



7t(2i, Zd, 9) = MZ[zi, Zd) 7oo(zd, d) 



(11) 



To make the notation easier, we shall ignore the depen- 
dence on Zd of 7t and 700 . Using the normalization con- 
stant defined by equation ©, we obtain for the noise of 
the measurement: 



Var(M) = 



loo{9kYZ{zi,ZdY 
Var(7T(6'fc, Zi)) 



(12) 
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with 7oo and Z evaluated for the NFW profile and a 
ACDM cosmology. Var(77-(^?fe, z^)) is given by equa- 
tion ©I 

Var(7T(6'fc,z^)) 



where N{9k, Zi) is the number of galaxies in redshift bin 
i sheared in annulus k and 



Je 



dzgdip d^9V{zs)n{0, (p) . 



If the annuli are dense enough and using the above re- 
lations, as well as equations Q and ((SJ, we can rewrite 
the mass estimator of an NFW cluster as: 



(13) 



where the Pa's generically denote cosmological parame- 
ters. g{pa) is the change of the estimated mass from the 
true virial mass if the cosmology is different from ACDM: 



dZsVjZs) Jo'"" dOO^TjO, Zs,Zd)j^{0, Zs, Zd) 

dz,v{zs) J''- dee%{B, z,,zd) 



(14) 

7t(^, is the tangential shear per unit mass for a 

ACDM cosmology. 7™ is the shear one will measure in 
the real cosmology, possibly other than ACDM. 

Note that if 700 (^) is independent of cosmology, then 
g{pa) is simply a ratio of angular-diameter distances. 
More generally, the parameters of the NFW profile of 
a cluster of given A/vir will depend upon cosmology. 

The variance of the mass estimator, written also in the 
case of continuous annuli is: 



Var(M, Zd) = 

x[j^dz.,piz,,)z'iz,,zd)J^"-dee^/lie,zd)y\i5) 



Here we have considered a constant angular concentra- 
tion of galaxies. For the upper bound of the second in- 
tegral in the above equation we take = ^ , with 
Ryir the virial radius of the measured cluster. This choice 
of Oiim seems a reasonable trade-off for looking widely 
enough to get a significant signal without too much con- 
tamination by other lensing structures. 

A cluster is detectable if its mass estimator has a min- 
imum value of 



M„iin(zd) = (S'/iV)min VVar(M,Zd) 



(16) 



where (5'/A^)min is the detection threshold that we choose 
for our measurement and Var(M, Zd) is given by equa- 
tion ifT^ . 

Let us emphasize again that only in the case of a canon- 
ical cluster (NFW profile in a ACDM cosmology) does 



our estimator translate the shear signal into the cluster's 
virial mass. If the cosmology is changed from ACDM, the 
same shear signal results from a different virial mass, as 
one can see from equation H13|l . But even so we are still 
able to find and count objects. Since A^-body simulations 
will yield the functional dependence on cosmology of the 
difference between the estimated and the virial mass, we 
can still understand our counts and know how to extract 
cosmological constraints from them. The measurement 
is well-defined for clusters that do not have the canonical 
NFW profile, but in this paper we do not attempt nu- 
merical simulations to derive the effect of this difference 
on WL cluster counts. 



B. Distribution of sources and the NFW shear 

We now present a few steps that we take in order to 
compute the variance given by equation (|15|l . 

We assume the source galaxies to have the redshift 
distribution introduced by |35| and used in many other 
works, e.g. see [2lll3lll36(: 



^(^) = exp(-z/zo). 



(17) 



The mean of this distribution is (z) = 3zq ; the charac- 
teristics of the fiducial survey will set the value of zq. 

As already specified, we optimize our filter for spher- 
ically symmetric clusters with NFW profiles and in a 
ACDM cosmology. 

A thorough treatment of NFW lenses is given by , 
as well as by '38^ . The shear due to NFW mass distribu- 
tions is: 



7NFw(a;) 



[10/3- 



41n(l/2)] 



a; < 1 
X = 1 
a; > 1 , 



(18) 



where x = and is the scale radius of the halo. 

6c is the characteristic overdensity and pdz) — sttG ^ 
is the critical density of the universe. The characteristic 
overdensity is related to the concentration parameter of 
halos by the condition that the mean density within the 
virial radius Rvir should be AyirPc, where Ayir = 200 for 
NFW: Sc = ^ in(i+f)__o_ (see [H for more details). 

The functions 5< and 5> are independent of cosmology 
and of cluster parameters and are given by: 



9<i.x) 



8 tan 



4 tanh'" 



■ln(f)- 



(x2-l)VT^ 



(a:2_i)3/2 
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Given a cosmology and a halo of some mass and at some 
redshift, we need the concentration parameter c in order 
to compute the characteristic overdensity and the scale 
radius. We assume a relation between c and M as given 
by 



C. Mass thresholds for nominal surveys 

In order to see the redshift dependence of the WL 
mass thresholds provided by equations l|15|) and we 
consider the examples of possible two future surveys: a 
Large Survey Telescope (LST) from the ground, and the 
space-based Supernova Acceleration Probe (SNAP). We 
assume for both surveys = 0.3. For LST we use a 
distribution of galaxies with a mean redshift of 1, i.e. 
zq = 0.33. The angular concentration of sources is n = 30 
galaxies/arcmin^ and the survey area is ^ = ISOOOdeg^. 
In the case of SNAP, we take zq = 0.5, n = 100 
galaxies/arcmin^ and A — lOOOdeg^. For a fixed Zd we 
compute Var(M, z^) as given by equation ((T3|l and then 
we find the smallest value of M verifying This 
value gives Mmin{zd)- The solid fines in figure Q] show 
the detection thresholds for both instruments using the 
NFW definition of the virial mass. To compute the clus- 
ter abundance, we shall convert to the Sheth-Tormen def- 
inition of virial mass and Mmin will rise by approximately 
30% for Zd < 0.4. 

Not surprisingly, the detection threshold for the space 
telescope is much lower than that of the ground telescope 
(3 or 4 times for small redshifts and almost an order of 
magnitude for redshifts higher than 1). In compensation, 
LST covers a substantially larger area of the sky, so in 
the end both telescopes can detect similar abundances of 
clusters. 

Using equation H19|) . we obtain « 21500 detectable 
clusters for LST and « 17500 for SNAP if (S'/A^)min = 5. 
If {S/N)^in = 10, the numbers are 1600 and 2200 respec- 
tively. As the cluster mass function is very steep, the 
derived cluster counts and resultant cosmological con- 
straints are quite sensitive to assumptions about input 
noise levels (cr^, n, and Zd) and (5/A^)inin- 



IV. FISHER MATRIX CALCULATIONS 
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FIG. 1: Minimum detectable mass for LST and SNAP when 
(S/N)min = 5. The solid lines represent Mmin when the intrin- 
sic ellipticity noise dominates the measurement. The dotted 
lines represent Mmin when the projection noise is dominant. 
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FIG. 2: Cluster redshift distribution for LST and SNAP when 
(S/N)niin = 5. The solid lines correspond to the case when 
the shape noise dominates. The dotted lines show the clus- 
ter distribution when in addition to the shape noise we also 
include projection effects in the minimum detectable mass. 



In this section we use the Fisher information matrix 
to estimate how well WL cluster surveys can constrain 
the following parameters: ilm, CS) 'U'o E^nd Wa- First we 
compute the number of clusters detectable by our fiducial 
surveys. Then we calculate the Fisher matrix considering 
Poisson noise and sample variance noise. Throughout 
this section we hold to the assumption that the intrinsic 
ellipticities of galaxies are the main noise source in the 
cluster measurements. 



A. Cluster abundances 

As mentioned in section the identification of the 
projected mass with the virial mass allows us to calculate 
the number of detectable clusters as an integral of the 
mass function dv^dM ■ "^^^ number of clusters per unit 
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solid angle is 



dN 

In 



dz 



dV 
dz dfl 



d\nM 



M„,„(z) 



dN{M, z) 
dVdlnM 



(19) 



Here dV is the comoving volume element and Mniin(z) 
is the virial mass of a cluster still detectable at redshift 
z, obtained from equations H13|) and Ijltill from section 
illlll The comoving number density of halos is given 
dVdAnM c^lnM = -^f{v)dv, where p is the comoving 
matter density and f{v) is a semi-analytic dimensionless 

form of the mass function, v = Ssc is the critical 
density for spherical collapse and a^(M, z) is the linear 
density field variance, smoothed with a top-hat filter. 
a{M, z) is the only quantity in the mass function that 
depends on the dark energy equation of state through 
the growth factor: 

a{M,z) ^ D{z)a{M,z = 0) . 

We assume a Sheth-Tornien f{v) |2^: 



fsrii^) = AJ— [I + ia,,)-P] exp(-aj./2) 
with A = 0.3222, a = 0.707,p = 0.3, S^c = 1.69. 



B. The Fisher matrix 

The Fisher matrix calculations were completed for cells 
corresponding to pairs of bins in redshift and filtered 
shear {z,M}. This choice of binning has two major 
benefits. Firstly, we use a directly measured quantity, 
the (filtered) shear M, rather than an inferred mass, to 
describe the data. Thus we can monitor how both the 
number of objects in a bin and their masses change with 
cosmology, in accordance with equations ifT^ and (fH^ . 
Secondly, we retain information on the number vs M i.e. 
we use the shape of the mass function. Most papers deal- 
ing with cluster Fisher matrix estimations wash away the 
information given by the shape of the mass function by 
integrating the mass function over mass above threshold. 
We considered 50 Af-bins and 35 redshift bins, going from 
to 2. 

Ignoring the errors in redshift determinations (for both 
galaxies and clusters), bin counts are affected by 2 kinds 
of noise: Poisson noise and sample variance noise. The 
latter arises from the fact that the large scale structure of 
the universe correlates the number density of virialized 
objects in different volume elements. have studied 
the importance of sample variance noise relative to Pois- 
son noise in cluster surveys . They concluded that sample 
variance can dominate Poisson noise for low-mass clus- 
ters, M < 4 X IO^^Mq/H. 

The Poisson Fisher matrix is defined as: 



i=l 



(20) 



where Ni^a = dNi/dpa and Pa,P(3 can be any of the 
four parameters mentioned earlier. is the estimated 
number of clusters for bin i and / is the total number of 
bins. 

To account for the correlations between bins, we follow 
[19| and employ an approximation of the Fisher matrix 
from the limiting cases of Poisson dominance and sample 
variance dominance: 

F^p = C~^N^p + i Tr[C-i C^^ S,^] . (21) 

N is a vector of length I whose elements are the iVj 's from 
above. C is the covariance matrix (dimension / x /), a 
sum of the shot noise and the sample covariance noise: 
C ~ S + TV. A/" is a diagonal matrix with jV , on its 
diagonal. The sample covariance matrix is (e.g. j2(l|'l: 



5y = b,bjN,NjD,Dj 



d^k 
(2^ 



Wdk)W*{k)Pun{k). 



Here i and j are bin indices, Piin(fc) is the linear power 
spectrum and b is the linear bias as given by |28| : 



b{M, z) = l + 



1 



2p 



where a, p and Ssc have the same values as for the mass 
function. D is the linear growth factor and Wi{k) are the 
survey windows. As suggested in 20], we considered the 
survey windows a series of concentric cylinders (redshift 
slices) at comoving distance r^, of height Sri (Sri ^ J^i), 
all subtending the same angle 6s. In Fourier space such 
windows have the expression: 



Wi{k) = 2exp(ifc||ri) 



sin{k\\Srj2) Ji{k±ri' 



kiiSri/2 k±^ri0s 



where k\\ and k± are the components of k, parallel and 
perpendicular to the line of sight and Ji is the first order 
Bessel function. 



C. Results 

To the cluster constraints we have added CMB con- 
straints obtained for a Planck-type experiment with 65% 
sky coverage (Masahiro Takada, private communication). 
The temperature and polarization power spectra and the 
cross spectrum were computed using CMBFAST version 
4.5.1. The Fisher matrix was initially calculated for 
9 parameters: Wcdm, ^b, ^de, wq, Wa, n (primordial 
power spectrum index). As (amplitude of initial scalar 
fiuctuations), a (primordial running index), and r (op- 
tical depth at recombination). This 9-dimensional ma- 
trix was then projected into our 4-dimensional parameter 
space and the new matrix was added to the Fisher ma- 
trix given by equation (|21|l . Two tables corresponding to 
(S'/iV)inin = 5 and (S'/-/V)inin = 10 are shown below. 
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We first note that the constraints from the two exper- 
iments are remarkably similar. LST measures erg slightly 
better than SNAP, while the opposite is true for Wa, 
the parameter describing the evolution of the dark en- 
ergy equation of state. We expect this trend: due to 
its higher detection threshold, LST takes more informa- 
tion from the steep high-mass end of the mass function, 
where a variation in as is very acutely felt. On the other 
hand, SNAP can see higher redshift clusters and test a 
deeper survey volume than LST, so is more sensitive to 
the evolution of the dark energy equation of state. 

The effect of sample variance on the dark-energy con- 
straints is minimal, < 15% for the SNAP survey and 
< 5% for LST. Adding CMB information brings down 
the constraints on all parameters, but especially on Wa, 
by w 50% for both instruments. This is further improved 
(by 40%-50%) if we consider LST and SNAP as indepen- 
dent experiments (assuming they probe different survey 
volumes) and just sum their Fisher matrices. This can 
be seen best in figure O where we have plotted the con- 
straints on the dark energy equation of state parameters 
for both telescopes when (S'/A^),„in = 5. 

Finally we note that dark energy constraints depend 
strongly on choice of threshold {S/N)^in. DoubHng the 
detection threshold to 10 degrades the cosmological con- 
straints by a factor ^ 2 for the SNAP+CMB case, or 
factors of ~ 3 for LST+CMB. 

We conclude that both instruments have similar per- 
formances and that, at least for constraining dark energy, 
cluster counts are useful when combined with other ex- 
periments. Our results are far less optimistic than the 
predictions given by |40l | , but we shall defer comparisons 
with other papers to the last section. 



(S/N^in = 5 


SNAP 




Poisson 


Poisson+SV 


Add CMB 




0.005 


0.006 


0.005 


0-8 


0.006 


0.007 


0.006 


Wo 


0.078 


0.087 


0.069 


Wa 


0.304 


0.348 


0.197 




LST 




0.004 


0.005 


0.004 




0.004 


0.004 


0.004 


wo 


0.076 


0.077 


0.062 


Wa 


0.373 


0.380 


0.182 




LST+SNAP 




0.003 


0.003 


0.003 


0-8 


0.003 


0.003 


0.003 


Wo 


0.043 


0.046 


0.038 


Wa 


0.197 


0.213 


0.125 



TABLE L 1-a constraints obtained for {S/N)min ~ 5, con- 
sidering only the intrinsic ellipticity noise. 



(S/N),„i„ = 10 


SNAP 




Poisson 


Poisson+SV 


Add CMB 




0.014 


0.014 


0.012 


0"8 


0.013 


0.013 


0.011 


Wo 


0.203 


0.204 


0.147 


Wa 


0.915 


0.940 


0.397 




LST 




0.015 


0.015 


0.013 


0"8 


0.011 


0.011 


0.008 


Wo 


0.268 


0.269 


0.190 


Wa 


1.590 


1.600 


0.548 




LST+SNAP 




0.007 


0.007 


0.007 


0-8 


0.006 


0.006 


0.005 


Wo 


0.121 


0.123 


0.092 


Wa 


0.644 


0.661 


0.268 



TABLE II: 1-a constraints obtained for {S/N)min ~ 10, con- 
sidering only the intrinsic ellipticity noise. 



V. LARGE SCALE STRUCTURE 
PROJECTIONS 

It has been anticipated that WL cluster measurements 
will be substantially compromised by the so-called pro- 
jection effects: the lensing signal could be produced by 
any structures along the line of sight, not just virialized 
clusters. This effect causes uncertainties in cluster mass 
determinations and consequently in cluster abundancies 
(e.g. (41] for a comprehensive discussion of errors on 
mass estimations and solutions to reduce them; see also 
m> S H)- HH study numerically the efficiency of lo- 
cating clusters in shear maps and conclude that about 
15% of the most significant peaks detected in noiseless 
WL maps do not have a collapsed halo with mass greater 
than lO^^-^MoJh within a 3' aperture; also see the re- 
lated work of |23| . 

The purpose of this section is to determine how impor- 
tant the large scale projections are for cluster counting 
and parameter constraints. We shall first consider the 
lensing signal of clusters with masses below the detection 
threshold computed in ijllll as the only source of noise 
for the projected mass measurement (i.e. we ignore com- 
pletely the intrinsic shape noise of galaxies). We shall 
also regard the signal of these smaller clusters as wholly 
uncorrelated with the signal of detectable clusters em- 
ployed for the estimates in ^IVI presuming them to be at 
widely separated redshifts. Given these assumptions, we 
would like to establish the minimum detectable mass of 
a cluster at an arbitrary redshift Zd- We follow the steps 
of section ^lllll to calculate the variance of the mass esti- 
mator M, replacing the intrinsic-ellipticity noise with the 
projection noise. We retain the weights derived in 
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by optimizing the S/N for the intrinsic-eUipticity noise. 
Although this is no longer an optimal filter, it allows us 
to compare the shape noise and the projection noise and 
to establish the redshift regime where each of them dom- 
inates. 



A. The mass estimator 

When the noise of the small structures along the line 
of sight dominates our measurement the contributions of 
different source-redshift bins are correlated, so we need 
to calculate the tangential shear power spectrum of these 
bins, as indicated by Q. Since the convergence power 
spectrum is easier to estimate than the tangential shear 
spectrum, we reexpress the mass estimator in terms of 
convergence rather than shear: 

M= f (feJ2^-i<^^^M0,z^), (22) 

i 

where we have considered directly the case of continuous 
annuli. The new convergence weights are linked to the old 
shear weights in the following manner (see for instance 
0): 

[x) = 2 r dy - (x) . (23) 

J X y 

As already said, we take the same shear weights that 
optimize the shot noise filter and manipulating a little 
equation 123|) we obtain for the convergence weights: 

^ [k(6',z) - k(6'h™,z)] , e<0u^ 
w^i9,z) = { -it (24) 

0, 6 > 0iirn , 



with 9iim defined in ijllll and the mean convergence inside 
a radius 9, R{9,z) = ^ dxxK{x,z). Just like in ^IIII 
we shall use the convergence of a canonical cluster with 
an NFW profile density. C is the constant defined by 
equation ijHJi. 

We write the estimator defined by (|22ll in Fourier space 
and take its variance: 

Var(M) = -i^ J2 f ^«(^^ ^«'(^) "'«(^^ ' 

(25) 

where we have used the definition of the convergence 
power spectrum: 

{k{1, z,) k{P, Zj)) = (27r)2 Pl^H) Soil- P) , 

and the Fourier transform of the convergence weight: 



w^{l,z) = 2Tr d9 9w^{9,z)Joil9). 
Jo 



The power spectrum for bins i, j is: (see for instance j4^]) 

2 



K^(0 



dZn 



dx 



dZri 



{1+Zj,f 



X W^{zp)Wj{zp)Ps 



Ps is the 3D matter power spectrum and x is the co- 
moving distance. The source weights are given by the 
expression: 



dzsVizs) Z{zs, z) 

" /;;-^ dz,v{z,) 



In the limit of infinite number of redshift bins this sim- 
plifies to: Wi{z) = Z{zi, z) . 

Combining all these ingredients and using (EH), the 
projection noise is thus defined: 



2 c2 



Var(M,Zd) -^a 



X / dH {d99[K^{9,Zd)-Roo{SHm,Zd)]Ml9)Y 



dZr, 



dx 



dz„ 



(1 -t- ZpfPs 



I 



Xizp) 



1 2 



dZs Z{Zs,Zd)Z{Zs, Zp)'P{Zs) 



(26) 



The nonlinear matter power spectrum is computed us- 
ing the halo model [43: it is the sum of a quasi-linear 
term and a halo term. The quasi-linear term gives the 
power resulting from correlations of distinct halos and 
dominates on large scales. The halo term describes the 
correlations of particles within the same halo and domi- 
nates on small scales. At every Zp (projection redshift), 
Ps is estimated as the power of the virialized structures 
with masses smaller than M^in{z) from section ijllll Re- 
call that halos above this mass are detected as clusters 
and are hence part of the signal, so do not contribute to 
the noise variance in the cluster-counting experiment. 



B. Projection Noise vs. Ellipticity Noise 

Equation l|26(l represents the projection noise, caused 
by the "unseen" small clusters that our featured surveys 
cannot detect, but which contribute nonetheless to the 
whole lensing signal. Solving again equation H16|l with 
the variance given by we find M^°^'{z) associated 
with this noise. In figure^we have plotted the minimum 
detectable mass corresponding to the shape noise (solid 
line) and to the projection noise (dotted line) for LST 
and SNAP when (5'/iV)„iin is 5. We note that projection 
noise exceeds intrinsic-shape noise only for z < 0.07 for 
LST, or slightly higher (z < 0.11) for SNAP due to its 
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lower shape-noise level. If (S'/A^)min=10, the crossover 
redshifts are 0.1 and 0.4, respectively. 

There are 2 factors that decide the function M^^°^'{z). 
Its shape is determined by the sources' redshift distribu- 
tion and its amplitude by the magnitude of the power 
spectrum that we integrate in H26(l . To understand why 
the projection noise is so high at small redshifts com- 
pared to the shape noise, we have to look at the way 
S/N scales with Dd (the lens-observer angular diame- 
ter distance) in each case. At small redshifts, the signal 
gets weak, because the shear gets weak. When the shape 
noise dominates, S/N does not have a dependence on 
Dd because the signal scales with the same way the 
noise does. This is not true for projection noise, where 
the noise and the signal scale differently, so 5'/A^ ^ 
as z at fixed mass. A cluster produces a maximum 
lensing signal if it lies halfway between the source and 
the observer: M^°^^ is minimal at the redshifts obeying 
this condition. For LST, (z) is 1, so the minimum occurs 
around 0.5. For SNAP it occurs at redshifts around 0.7, 
since (z) is 1.5 in this case. At higher redshifts LST has 
fewer sources than SNAP, so there Af^™-' is higher for 
LST than for SNAP. 

^min is even more sensitive to the choice of the detec- 
tion threshold than M^^:^^^'^. In the case of shape noise, 
the variance of the mass estimator does not depend on 
(5/-/V)„iin, as one can see from equation (IT^ . (5/A^)„iin 
impacts M^^° only when we solve ((TE|) . In the case of 
projection noise, the variance itself depends on (5/iV),nin 
through the power spectrum in equation l|26|) . For every 
projection redshift, we have excluded from the total mat- 
ter power spectrum the clusters with masses bigger than 
M^;^P° corresponding to that redshift, because they are 
individually identifiable as clusters and can be removed 
from the noise background. Therefore, the remaining 
noise spectrum is stronger when (5/A^)niin = 10 than 
when (5/A^)inin — 5. In the same vein, M^°^^ is greater 
for LST than for SNAP, because M^^^"^ is greater for 
LST than for SNAP. The importance of (5'/iV)mi„ is fur- 
ther propagated when we solve equation (|16|) for the vari- 
ance given by l|26|l . This is the reason why the difference 
between MP™^ obtained for (S'/A^)min = 5 and M^°J for 
(S'/iV),„in — 10 is greater than the difference between 
M^^° corresponding to the same detection thresholds 
of 5 and 10. 

In order to see the role of the projection noise in the 
cosmological parameter constraints, we have added the 
shape-noise critical mass and the projected-noise critical 
mass in quadrature and repeated the calculations from 
i)IVI And just like in illVI we took the variation with 
cosmology of the new total critical mass into account. 
FigureElshows the number of detectable clusters per unit 
redshift as a function of redshift when we consider only 
the shape noise (solid line) and both the shape and pro- 
jection noises (dashed line). The total number of clus- 
ters does not decrease significantly (< 10%) when the 
projection noise is included, if (5/^^)111111 5. There is 



negligible change to the derived cosmological constraints. 

Raising the threshold to lOo" can cut the number of 
detectable clusters more severely, particularly for SNAP, 
where it eliminates 60% of detections. The effect upon 
derived parameter constraints remains small, however, if 
we combine the cluster surveys with either the CMB or 
with each other. Figure 01 shows the constraints on wq 
and Wa obtained for SNAP first for ellipticity noise only 
and then when projections are included. 

If the projection noise were to become a significant 
contributor to the error budget, one could construct an 
estimator that maximizes the signal-to-noise in the pres- 
ence of both shape noise and projection noise [4l|, |4J| . 




-1.4 -1.2 -1 -0.8 -0.6 



FIG. 3: Poisson-|-sample variance constraints on wo and Wa 
when (S'/A'^)min ~ 5: From outermost to innermost, CMB 
is black, SNAP is green, LST is red (dashed), SNAP-I-LST 
is blue and finally the cyan ellipse is obtained by combining 
SNAP+LST-I-CMB. The intrinsic ellipticities of galaxies are 
the dominant noise here and we have marginalized over flm 
and (Jg. 



VI. DISCUSSION AND CONCLUSIONS 

In this paper we have tried to determine how use- 
ful WL-detected clusters are for constraining cosmology. 
We focus our attention on the matter density parameter 
(rim) J the power spectrum normalization (cts) and the 
time-evolving dark energy equation of state (wq, Wa), as- 
suming a flat universe. 

There are a few important features in our Fisher ma- 
trix analysis that distinguish it from previous work. We 
compute the Fisher matrix using as an observable the 
directly measured quantity, the flltered shear M. As- 
suming some density profile for the detected lenses, their 
measured shear can be converted to a virial mass. How- 
ever, this conversion depends on the cosmology. In differ- 
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FIG. 4: SNAP constraints on wq and Wa'- the orange (outer) 
solid ellipse is for (S/A'^)min = 10 and shape noise only, the 
green (inner) solid ellipse is for {S/N)min = 5 and shape noise 
only. The dotted ellipses are obtained for {S/N)niin ~ 5 and 
when we add the projection noise to the shape noise. These 
are all Poisson+sample variance errors and we have marginal- 
ized over flm and as- 





SNAP 




Poisson 


Poisson+SV 


Add CMB 




0.006 


0.006 


0.006 


crs 


0.006 


0.007 


0.006 


Wo 


0.082 


0.090 


0.072 


■Wa 


0.315 


0.355 


0.203 




LST 




0.005 


0.005 


0.005 


0"8 


0.004 


0.004 


0.004 


Wo 


0.077 


0.078 


0.063 


Wa 


0.380 


0.386 


0.185 




LST+SNAP 




0.003 


0.003 


0.003 


0-8 


0.003 


0.003 


0.003 


Wo 


0.044 


0.047 


0.038 


Wa 


0.201 


0.216 


0.126 



TABLE III: 1-cr constraints obtained for {S/N)min = 5 when 
we add the projection noise to the intrinsic ellipticity noise. 



ent cosmologies, different masses correspond to the same 
measured shear. Our Fisher matrix takes this fact into 
account. We have also employed the shape of the mass 
function, by keeping track of the M of the objects in bins 
rather than merely counting clusters above a threshold. 
A third distinction of our analysis is the halo-model cal- 



culation of the noise variance due to uncorrelated pro- 
jections along the line of sight. As noted above, this is 
shown to cause little loss of cosmological information in 
most circumstances. 

When we apply our formalism to canonical ground 
(LST) and space (SNAP) WL surveys, we find the two 
are nearly equivalent. The lower cluster-mass threshold 
afforded by the deeper sample of source galaxies in space 
provides a 15-fold increase in the sky density of detected 
clusters, which compensates for the smaller survey solid 
angle that we assume for the SNAP survey. Both sur- 
veys produce quite interesting dark-energy constraints, 
and improve significantly when combined with CMB con- 
straints. Combining the LST and SNAP surveys results 
in constraints significantly stronger than either alone. 

Comparison with previous forecasts for WL cluster sur- 
veys is complicated by the extreme sensitivity to the as- 
sumed noise level and detection threshold of the survey — 
this behavior is of course generically true of cluster- 
counting experiments. 

Our results are less encouraging than some other fore- 
casts in the literature: for a detection threshold of 5 and 
considering only the intrinsic ellipticities of galaxies, LST 
could detect about 21500 clusters. The calculated LST 
errors on cosmological parameters are: A51m = 0.005, 
Acts = 0.004, Awq = 0.08, Aw^ = 0.38, if we take into 
account both the Poisson and sample variance noises. 
SNAP yields rather similar constraints. 

These values obtained for the dark energy equation 
of state are a few times higher than found by [40l |: 
Aiuq = 0.05 and Awq = 0.09. These authors analyze 
an LST- type experiment, using a Gaussian filter for the 
shear signal, as proposed by |23| • Such a difference in our 
results is partly (but not completely) explained by the as- 
sumed parameters of their survey. The angular concen- 
tration of galaxies is 65 galaxies/ arcmin^, the survey area 
IS 18000deg^ and tJ^Y is 0.15, more optimistic than our cor- 
responding values of 30 galaxies/ arcmin^, 15000deg^ and 
0.3. Additionally, they take (S'/iV)inin = 4.5. They es- 
timate about 200000 clusters are detectable, 10 x higher 
than our nominal estimate. If we assume their input 
values, our optimal filter yields about 700000 clusters, 
demonstrating the advantage of an optimal filter. 

The last issue that we discussed is that of the projec- 
tion contamination, considered the sword of Damocles for 
WL measurements. From the beginning we make a cru- 
cial approximation: we identified the projected mass of 
clusters with their virial mass. This need not be true on 
a cluster-by-cluster basis, just in the sense that the mass 
functions have similar amplitude and dependence upon 
cosmological parameters. With this assumption — which 
needs to be verified numerically — about the effect of local 
structure on projected-mass estimates, we can next treat 
the projection of unrelated structures along the line of 
sight as a noise source, not a bias. 

WL cluster measurements are subject to two major 
sources of noise: the intrinsic ellipticities of galaxies 
(shape noise) and these large scale projections. We find 
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that, even when we use an overdensity-detection filter op- 
timized for pure shape noise, that the projection noise is 
dominant only for lenses at < 0.1, and has little effect 
upon the derived dark-energy constraints in most appli- 
cations of our canonical ground (LST) and space (SNAP) 
WL cluster surveys. 

Let us stress again that all these numbers are extremely 
sensitive to the detection threshold and the noise levels 
that we impose. If (5'/A^)min = 10, the number of de- 
tectable clusters goes down by a factor of 10 in the case 
of shape noise only and by more than a factor of 18 when 
we also account for projections. Then the errors on the 
dark energy equation of state parameters are as much as 
tripled. If the noise has Gaussian statistics, as we would 
expect for shape noise, then the 5cr threshold will suf- 
fice to keep false positives to an unimportant level. The 
projection noise will not, however, be Gaussian, so fur- 
ther investigation is required to determine whether the 
5(7 threshold offers enough suppression. The numerical 
study of suggests this is the case, as their 4.5ct thresh- 
old results in a false-positive contamination of only 25%. 
Recall that this work considers any object not associated 
with a virialized halo to be a false positive, whereas we 
assert that counting unvirialized objects is a valid cos- 
mological test, so this 25% "contamination" still carries 
cosmological information. 

We conclude, therefore, that counting of WL-detected 
clusters can be a powerful constraint on cosmology with 



either space- or ground-based surveys. While projection 
effects make it difficult to establish a one-to-one corre- 
spondence between WL-derived masses and virial masses, 
there is no real need to make such a correspondence in 
order to infer cosmological parameters from WL data. In- 
deed it is precisely the ability to go directly from A^-body 
simulations to WL-derived masses that makes WL clus- 
ter counting an attractive alternative to optical. X-ray, 
or SZ cluster counting. Our Fisher analysis incorporates 
the cosmological dependence of the conversion from shear 
to mass units, plus the effect of sample variance and pro- 
jection noise, and we show that none of these are barriers 
to strong dark-energy constraints. 
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